Self-organizing Map Teams up with the Buddy System

نویسنده

  • Dillon McTernan
چکیده

The Self-Organizing Map is a vital part of today's research towards unsupervised learning in Neural Networking. There are many commercial and scholarly applications for Self-Organizing Maps within data analysis, topological visualization, and clustering. In this paper we address the functionality of the Self-Organizing Map in its tendency towards greediness. It is found that the common solution to this issue is through an antigreedy function called the Block Party. This anti-greedy function was unable to satisfy my desired requirements; thus the objective of creating a new anti-greedy function produced the Buddy System function. This paper compares these two anti-greedy functions and the possible advantages for this new Buddy System function. INTRODUCTION Artificial Neural Networking is a considerably new and highly advanced field of study where biological neural networks are reconstructed into computational and mathematical models using supervised, unsupervised, and reinforced learning. The SelfOrganizing map(SOM) uses unsupervised learning to position weighted neurons in the vicinity of input data vectors as means to model the input data space. This placement is based upon matching the closest weight vector to the vector taken from the input data space. These best matching neurons are called winning neurons (WN). The new location of the WN is based upon a formula that analyzes the relationship between the WN and the input data vector. The idea of greediness occurs in the SOM when any neuron is not activated as a WN. This conflict will not allow an accurate distribution of the inactivated neuron, therefore producing a poor representation of the sample input data. Anti-greedy functions such as the Block Party were established to minimize this error [4]. Their general solution to this problem is to establish connections between the WN and its surrounding neurons. Thus when the WN moves closer to the input data vector, its neuron neighbors will also move in the same direction though with lesser intensity. The Block Party establishes the WN neighbors by initializing the neurons in a grid pattern where upon it chooses the neighbors with a block formation. A problem arose when desired requirements called for randomized placement for the initialization of the neurons. This lead to the creation of an anti-greedy function called the Buddy System. This new antigreedy function uses Euclidean distance to find a WN's neighbors and therefore negates the necessity for a pre-designed pattern for the initialization of neurons. The next section, BACKGROUND, will overview a brief history of beginning Neural Networks leading up to the Buddy System. Following will be a section on functionality where upon we will discuss the internal structure of the SOM, Block Party, and Buddy System as well as the problems and solutions of the Buddy System and its practical applications. The performance of the Buddy System will be evaluated in RESULTS and will be followed with the conclusion and references. Since these findings of the Buddy System were established in two dimensional space, we will be continuing the description of the SOM, Block Party, and Buddy System, in its entirety, in the two dimensional context. BACKGROUND Artificial Neural Networks became part of public awareness in the 1960's under the term perceptrons. In 1969 Minsky and Papert published a book that described the attributes of single layer perceptrons. These findings on the single layer perceptrons showed its characteristics to have drastic limitations in basic pattern recognition [5]. These findings where a devastating blow to the field of Artificial Neural Networking and created a loss of interest in this field [5]. It wasn't until the 1980's that this field became popular again as the discovery of trained multilevel perceptrons proved to accomplish the problematic pattern recognitions of single layer perceptrons [5]. The trained multilevel perceptrons created the idea of supervised learning, and in conjunction, it brought up the quandary of unsupervised learning. It was in the early 1980's when Tevo Kohonen helped to establish the idea of unsupervised learning with the invention of the Self-Organizing Map(SOM) [3]. One of the major dilemmas with his SOM was an issue of greediness within his algorithm, where upon the creation the block party function was established to minimalize this issue [4]. Though the Block Party function lends adequate assistance to the SOM, there lacked a sense of randomness in the initialization of the neurons. Thus, a new anti-greedy function, called the Buddy system, was created and established a randomized initialization of neurons for the SOM. FUNCTIONALITY The SOM starts with gathering a rectangular boundary around the input coordinates. It then starts to initialize the neurons with x,y coordinates from inside the boundary. After the initialization of the neurons, the SOM selects the first input data point in a list and finds the closest neuron to that data point, where upon it assigns it as the WN. It then uses the formula: WN[new_location] = WN[old_location] + ((DataPointWN[old_location])*epsilon ) to establish the new location of the WN.[2] Note that epsilon is a monotonically decreasing learning coefficient. This procedure continues down the list for every data point in our sample. During this procedure of repeatedly going over the list of data points, the SOM compares the neurons' current location to its previous location. If they have all moved less than the “margin of error,” then the SOM stops; otherwise it continues until such requirements have been met. The Block Party function works along side the SOM, so the chance of a neuron not being set as a WN, or even moved at all, will be minuscule. This function informs the SOM to initialize the neurons in a grid pattern within the input boundary. The function then assigns the neurons that form a block around each neuron as its neighbors. So that when a neuron becomes a WN it has a block of neighbors that follow it . The formula in which the neighbors use to move is the same as the WN except that the epsilon is significantly less, thus moving with a lesser intensity. The Buddy System works with the SOM on similar principles as the Block Party, except that it informs the SOM to initialize the neurons to random coordinates within the sample boundary. The way that the Buddy System creates a neighborhood of neurons is by first establishing how many neighbors will belong to a WN and then finding that number of the closest neurons to set as neighbors. Like the Block Party, the Buddy system creates the neighborhoods before the SOM starts processing the sample data, keeping the neighborhoods static. Again the neighbors of the WN will use the same formula as the WN except that the epsilon will be significantly less. Problems and solutions When creating the Buddy System it was found that on occasional examples there were neurons that did not move during the completed process. It was after many examples that brought the attention of a major difference between the Block Party and the Buddy System. Due to the grid formation of the Buddy system it was guaranteed that each WN would have a neighbor and that each neuron would be part of a neighborhood. [1] Even though each WN had a set amount of neighbors in the Buddy System, it was not guaranteed that each neuron was part of a neighborhood. The solution to this problem was to set the amount of neighbors to be one less than the amount of total neurons, thus guaranteeing every neuron to function as a neighbor or expanding a neuron's neighborhood to incorporate this lost neuron. The later solution builds on the idea of allowing these neighborhoods to become dynamic if necessary. The idea is that if the Buddy System function notices a lost neuron, it will add it to its closest moving neighbor's neighborhood, thus dynamically expanding this neighbor's neighborhood to incorporate a lost neuron. Application One of the greatest applications of the Buddy System is that if one is able to randomly initialize neurons on a coordinate plane, then one can initialize the neurons to any desired location. The reason that this would be beneficial is if one would want to take note of each neuron's distance from its beginning location to its final location. A practical application of this would be a telecommunications company looking to update the location of their cell towers to meet the needs of their client base; consider the neurons to be cell phone towers and the sample input data to be an updated population survey. The telecommunications company can use the Buddy System along with a SOM to retrieve the cell tower's current location and its potential location that would best suit this new population survey. This will allow them to accurately analyze data for essential documentation such as a cost versus value report or a feasibility report.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Estimation of Missing Daily Precipitation and Runoff Using Self-Organizing Map (A Case Study: Mazandaran Province)

Expert aquatic designers face many problems; among these, in hydrology, defective occurrences in time-series can cause errors in the ultimate results of the study. This more often happens in the regions where the number of hydrometric and rain gauge stations is limited. In addition, assessing, developing and maintaining the use of water resources require accessible long-term and high-quality qu...

متن کامل

Uncertainty Modeling of a Group Tourism Recommendation System Based on Pearson Similarity Criteria, Bayesian Network and Self-Organizing Map Clustering Algorithm

Group tourism is one of the most important tasks in tourist recommender systems. These systems, despite of the potential contradictions among the group's tastes, seek to provide joint suggestions to all members of the group, and propose recommendations that would allow the satisfaction of a group of users rather than individual user satisfaction. Another issue that has received less attention i...

متن کامل

A Modfied Self-organizing Map Neural Network to Recognize Multi-font Printed Persian Numerals (RESEARCH NOTE)

This paper proposes a new method to distinguish the printed digits, regardless of font and size, using neural networks.Unlike our proposed method, existing neural network based techniques are only able to recognize the trained fonts. These methods need a large database containing digits in various fonts. New fonts are often introduced to the public, which may not be truly recognized by the Opti...

متن کامل

Landforms identification using neural network-self organizing map and SRTM data

During an 11 days mission in February 2000 the Shuttle Radar Topography Mission (SRTM) collected data over 80% of the Earth's land surface, for all areas between 60 degrees N and 56 degrees S latitude. Since SRTM data became available, many studies utilized them for application in topography and morphometric landscape analysis. Exploiting SRTM data for recognition and extraction of topographic ...

متن کامل

Gait Based Vertical Ground Reaction Force Analysis for Parkinson’s Disease Diagnosis Using Self Organizing Map

The aim of this work is to use Self Organizing Map (SOM) for clustering of locomotion kinetic characteristics in normal and Parkinson’s disease. The classification and analysis of the kinematic characteristics of human locomotion has been greatly increased by the use of artificial neural networks in recent years. The proposed methodology aims at overcoming the constraints of traditional analysi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009